All Words Unsupervised Semantic Category Labeling for Hindi

نویسندگان

  • Siva Reddy
  • Abhilash Inumella
  • Rajeev Sangal
  • Soma Paul
چکیده

In the task of semantic category labeling, given a text, every word in it has to be assigned a semantic category. Our language of interest is Hindi. We use the ontological categories defined in Hindi Wordnet as semantic category inventories. In this paper we present two unsupervised approaches namely Flat Semantic Category Labeler (FSCL) and Hierarchical Semantic Category Labeler (HSCL ). The former method treats semantic categories as a flat list, whereas the latter one exploits the hierarchy among the semantic categories in a top down manner. Further our methods use simple probabilistic models, using which the category labeling becomes a simple table look up with little extra computation and thus opening the possibility of it’s use in real-time interactive systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hindi Semantic Category Labeling Using Semantic Relatedness Measures

In this paper, we evaluate and compare six semantic relatedness measures used for Hindi semantic category labeling. Our experiments show that the measure “adapted lesk” performed better than other measures. However, a simple baseline system achieved better accuracy than all the measures.

متن کامل

Towards Building Semantic Role Labeler for Indian Languages

We present a statistical system for identifying the semantic relationships or semantic roles for two major Indian Languages, Hindi and Urdu. Given an input sentence and a predicate/verb, the system first identifies the arguments pertaining to that verb and then classifies it into one of the semantic labels which can either be a DOER, THEME, LOCATIVE, CAUSE, PURPOSE etc. The system is based on 2...

متن کامل

برچسب‌زنی خودکار نقش‌های معنایی در جملات فارسی به کمک درخت‌های وابستگی

Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...

متن کامل

Empty Argument Insertion in the Hindi PropBank

This paper examines both linguistic behavior and practical implications of empty argument insertion in the Hindi PropBank. The Hindi PropBank is annotated on the Hindi Dependency Treebank, which contains some empty categories but rarely the empty arguments of verbs. In this paper, we analyze four kinds of empty arguments, *PRO*, *REL*, *GAP*, *pro*, and suggest effective ways of annotating thes...

متن کامل

The role of orienting attention for learning novel phonetic categories

The current study examines the role of attention in phonetic category formation by experimentally manipulating endogenous orienting of attention. Two native English speaking participant groups were trained with an identical set of novel Hindi words containing unfamiliar consonants produced by multiple native Hindi speakers. Via instructions, the sound-attending group (N=37) was oriented toward ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009